Search CORE

arXiv.org e-Print Archive

RNA secondary structure prediction from multi-aligned sequences

It has been well accepted that the RNA secondary structures of most functional non-coding RNAs (ncRNAs) are closely related to their functions and are conserved during evolution. Hence, prediction of conserved secondary structures from evolutionarily related sequences is one important task in RNA bioinformatics; the methods are useful not only to further functional analyses of ncRNAs but also to improve the accuracy of secondary structure predictions and to find novel functional RNAs from the genome. In this review, I focus on common secondary structure prediction from a given aligned RNA sequence, in which one secondary structure whose length is equal to that of the input alignment is predicted. I systematically review and classify existing tools and algorithms for the problem, by utilizing the information employed in the tools and by adopting a unified viewpoint based on maximum expected gain (MEG) estimators. I believe that this classification will allow a deeper understanding of each tool and provide users with useful information for selecting tools for common secondary structure predictions.Comment: A preprint of an invited review manuscript that will be published in a chapter of the book `Methods in Molecular Biology'. Note that this version of the manuscript may differ from the published versio

CiteSeerX

Thermodynamically based DNA strand design

Author: Andronescu Mirela
Chang Seo Bong
Condon Anne
Hoos Holger H.
Shortreed Michael R.
Smith Lloyd M.
Tulpan Dan
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

We describe a new algorithm for design of strand sets, for use in DNA computations or universal microarrays. Our algorithm can design sets that satisfy any of several thermodynamic and combinatorial constraints, which aim to maximize desired hybridizations between strands and their complements, while minimizing undesired cross-hybridizations. To heuristically search for good strand sets, our algorithm uses a conflict-driven stochastic local search approach, which is known to be effective in solving comparable search problems. The PairFold program of Andronescu et al. [M. Andronescu, Z. C. Zhang and A. Condon (2005) J. Mol. Biol., 345, 987–1001; M. Andronescu, R. Aguirre-Hernandez, A. Condon, and H. Hoos (2003) Nucleic Acids Res., 31, 3416–3422.] is used to calculate the minimum free energy of hybridization between two mismatched strands. We describe new thermodynamic measures of the quality of strand sets. With respect to these measures of quality, our algorithm consistently finds, within reasonable time, sets that are significantly better than previously published sets in the literature

CiteSeerX

arXiv.org e-Print Archive

Target prediction and a statistical sampling algorithm for RNA-RNA interaction

Author: Akutsu
Alkan
Andronescu
Argaman
Bachellerie
Banerjee
Benne
Bernhart
Busch
Chitsaz
Chitsaz
Christian M. Reidys
Ding
Dirks
Dowell
Fenix W. D. Huang
Geissmann
Giegerich
Hekimoglu
Hofacker
Huang
Jing Qin
Kugel
McCaskill
McManus
Mneimneh
Mückstein
Mückstein
Narberhaus
Pervouchine
Peter F. Stadler
Qin
Rehmsmeier
Rivas
Salari
Tacker
Tafer
Tjaden
Udekwu
Urban
Zuker
Publication venue
Publication date: 05/08/2009
Field of study

It has been proven that the accessibility of the target sites has a critical influence for miRNA and siRNA. In this paper, we present a program, rip2.0, not only the energetically most favorable targets site based on the hybrid-probability, but also a statistical sampling structure to illustrate the statistical characterization and representation of the Boltzmann ensemble of RNA-RNA interaction structures. The outputs are retrieved via backtracing an improved dynamic programming solution for the partition function based on the approach of Huang et al. (Bioinformatics). The

O(N^6)

time and

O(N^4)

space algorithm is implemented in C (available from \url{http://www.combinatorics.cn/cbpc/rip2.html})Comment: 7 pages, 10 figure

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Free energy estimation of short DNA duplex hybridizations

Author: A Panjkovich
AV Vologodskii
C Schmidt
CA Gelfand
CL Clark
D LeBlanc
Dan Tulpan
F Aboul-ela
F Barbault
F Seela
F Tanaka
GA Leonard
GE Plum
HT Allawi
HT Allawi
HT Allawi
HT Allawi
IL Hofacker
J Petruska
J Petruska
J SantaLucia
J SantaLucia
J SantaLucia
JS McCaskill
KJ Breslauer
L Ratmeyer
LE A
M Andronescu
M Andronescu
M Andronescu
M Zuker
MC Pirrung
Mirela Andronescu
MJ Doktycz
MJ Doktycz
N Peyret
N Sugimoto
N Sugimoto
N Sugimoto
N Tibanyenda
O Gotoh
P Wu
R Owczarzy
R Owczarzy
RZ Gharaibeh
S Bommarito
S Delcourt
S Nakano
S Wuchty
Serge Leger
WD Wilson
Y Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Estimation of DNA duplex hybridization free energy is widely used for predicting cross-hybridizations in DNA computing and microarray experiments. A number of software programs based on different methods and parametrizations are available for the theoretical estimation of duplex free energies. However, significant differences in free energy values are sometimes observed among estimations obtained with various methods, thus being difficult to decide what value is the accurate one. Results We present in this study a quantitative comparison of the similarities and differences among four published DNA/DNA duplex free energy calculation methods and an extended Nearest-Neighbour Model for perfect matches based on triplet interactions. The comparison was performed on a benchmark data set with 695 pairs of short oligos that we collected and manually curated from 29 publications. Sequence lengths range from 4 to 30 nucleotides and span a large GC-content percentage range. For perfect matches, we propose an extension of the Nearest-Neighbour Model that matches or exceeds the performance of the existing ones, both in terms of correlations and root mean squared errors. The proposed model was trained on experimental data with temperature, sodium and sequence concentration characteristics that span a wide range of values, thus conferring the model a higher power of generalization when used for free energy estimations of DNA duplexes under non-standard experimental conditions. Conclusions Based on our preliminary results, we conclude that no statistically significant differences exist among free energy approximations obtained with 4 publicly available and widely used programs, when benchmarked against a collection of 695 pairs of short oligos collected and curated by the authors of this work based on 29 publications. The extended Nearest-Neighbour Model based on triplet interactions presented in this work is capable of performing accurate estimations of free energies for perfect match duplexes under both standard and non-standard experimental conditions and may serve as a baseline for further developments in this area of research.</p

NRC Publications Archive

Multifunctional materials for bone cancer treatment

Author: Andronescu Ecaterina
Ferreira Jose M. F.
Ficai Anton
Ficai Denisa
Marques Catarina
Sonmez Maria
Publication venue: 'Dove Medical Press Ltd.'
Publication date: 01/01/2014
Field of study

The purpose of this review is to present the most recent findings in bone tissue engineering. Special attention is given to multifunctional materials based on collagen and collagen-hydroxyapatite composites used for skin and bone cancer treatments. The multi-functionality of these materials was obtained by adding to the base regenerative grafts proper components, such as ferrites (magnetite being the most important representative), cytostatics (cisplatin, carboplatin, vincristine, methotrexate, paclitaxel, doxorubicin), silver nanoparticles, antibiotics (anthracyclines, geldanamycin), and/or analgesics (ibuprofen, fentanyl). The suitability of complex systems for the intended applications was systematically analyzed. The developmental possibilities of multifunctional materials with regenerative and curative roles (antitumoral as well as pain management) in the field of skin and bone cancer treatment are discussed. It is worth mentioning that better materials are likely to be developed by combining conventional and unconventional experimental strategies

Repositório Institucional da Universidade de Aveiro

Prediction of RNA secondary structure by maximizing pseudo-expected accuracy

Author: B Knudsen
C Do
D Mathews
H Kiryu
I Hofacker
I Holmes
IL Hofacker
JS McCaskill
K Sato
Kengo Sato
Kiyoshi Asai
L Carvalho
L Kall
M Andronescu
M Andronescu
M Hamada
M Hamada
M Hamada
M Hamada
M Parisien
M Zuker
M Zuker
MC Frith
Michiaki Hamada
N Michal
P Baldi
PP Gardner
R Durbin
RK Bradley
RK Bradley
S Bernhart
S Engelen
S Griffiths-Jones
S Gross
S Seemann
SJ Schroeder
Y Ding
Y Ding
Y Ding
ZJ Lu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Recent studies have revealed the importance of considering the entire distribution of possible secondary structures in RNA secondary structure predictions; therefore, a new type of estimator is proposed including the maximum expected accuracy (MEA) estimator. The MEA-based estimators have been designed to maximize the expected accuracy of the base-pairs and have achieved the highest level of accuracy. Those methods, however, do not give the single best prediction of the structure, but employ parameters to control the trade-off between the sensitivity and the positive predictive value (PPV). It is unclear what parameter value we should use, and even the well-trained default parameter value does not, in general, give the best result in popular accuracy measures to each RNA sequence. Results Instead of using the expected values of the popular accuracy measures for RNA secondary structure prediction, which is difficult to be calculated, the <it>pseudo</it>-expected accuracy, which can easily be computed from base-pairing probabilities, is introduced. It is shown that the pseudo-expected accuracy is a good approximation in terms of sensitivity, PPV, MCC, or F-score. The pseudo-expected accuracy can be approximately maximized for each RNA sequence by stochastic sampling. It is also shown that well-balanced secondary structures between sensitivity and PPV can be predicted with a small computational overhead by combining the pseudo-expected accuracy of MCC or F-score with the γ-centroid estimator. Conclusions This study gives not only a method for predicting the secondary structure that balances between sensitivity and PPV, but also a general method for approximately maximizing the (pseudo-)expected accuracy with respect to various evaluation measures including MCC and F-score.</p

RNAalifold: improved consensus structure prediction for RNA alignments

Abstract Background The prediction of a consensus structure for a set of related RNAs is an important first step for subsequent analyses. RNAalifold, which computes the minimum energy structure that is simultaneously formed by a set of aligned sequences, is one of the oldest and most widely used tools for this task. In recent years, several alternative approaches have been advocated, pointing to several shortcomings of the original RNAalifold approach. Results We show that the accuracy of RNAalifold predictions can be improved substantially by introducing a different, more rational handling of alignment gaps, and by replacing the rather simplistic model of covariance scoring with more sophisticated RIBOSUM-like scoring matrices. These improvements are achieved without compromising the computational efficiency of the algorithm. We show here that the new version of RNAalifold not only outperforms the old one, but also several other tools recently developed, on different datasets. Conclusion The new version of RNAalifold not only can replace the old one for almost any application but it is also competitive with other approaches including those based on SCFGs, maximum expected accuracy, or hierarchical nearest neighbor classifiers.</p

Fraunhofer-ePrints

ViennaRNA Package 2.0

Author: A Busch
A Sczyrba
A Waugh
AJ Enright
AR Gruber
AR Gruber
AR Gruber
B Kaczkowski
B Knudsen
B Matthews
C Aksay
C Flamm
C Flamm
C Flamm
C Höner zu Siederdissen
CB Do
Christian Höner zu Siederdissen
Christoph Flamm
D Sankoff
D Thirumalai
D Upper
DA Benson
DH Mathews
DH Mathews
DH Mathews
DH Turner
EP Nawrocki
H Kiryu
H Tafer
H Tafer
H Tafer
H Tafer
Hakim Tafer
I Tinoco Jr
I Tinoco Jr
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
IL Hofacker
Ivo L Hofacker
J Hertel
J Hertel
J Reeder
J SantaLucia
JA Jaeger
JH Havgaard
JN Zadeh
JN Zadeh
JS McCaskill
JS Reuter
K Darty
K Reiche
L Dagum
L He
M Andronescu
M Andronescu
M Andronescu
M Fekete
M Hamada
M Höchsmann
M Kalaš
M Larkin
M Parisien
M Rehmsmeier
M Tacker
M Zuker
M Zuker
M Zuker
MS Andronescu
MS Waterman
NR Markham
P Gardner
P Gardner
P Schuster
Peter F Stadler
R Dowell
R Klein
R Lorenz
R Nussinov
R Nussinov
R Thadani
RA Dimitrov
RE Bruccoleri
Ronny Lorenz
RR Stocsits
S Bernhart
S Bonhoeffer
S Heyne
S Washietl
S Will
S Wuchty
S Zakov
SH Bernhart
SH Bernhart
SM Freier
SR Eddy
Stephan H Bernhart
T Xia
U Mückstein
V Rusinov
W Beyer
W Fontana
W Fontana
W Fontana
W Fontana
W Pearson
Y Ding
Y Ding
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Secondary structure forms an important intermediate level of description of nucleic acids that encapsulates the dominating part of the folding energy, is often well conserved in evolution, and is routinely used as a basis to explain experimental findings. Based on carefully measured thermodynamic parameters, exact dynamic programming algorithms can be used to compute ground states, base pairing probabilities, as well as thermodynamic properties. Results The <monospace>ViennaRNA</monospace> Package has been a widely used compilation of RNA secondary structure related computer programs for nearly two decades. Major changes in the structure of the standard energy model, the <it>Turner 2004 </it>parameters, the pervasive use of multi-core CPUs, and an increasing number of algorithmic variants prompted a major technical overhaul of both the underlying <monospace>RNAlib</monospace> and the interactive user programs. New features include an expanded repertoire of tools to assess RNA-RNA interactions and restricted ensembles of structures, additional output information such as <it>centroid </it>structures and <it>maximum expected accuracy </it>structures derived from base pairing probabilities, or <it>z</it>-<it>scores </it>for locally stable secondary structures, and support for input in <monospace>fasta</monospace> format. Updates were implemented without compromising the computational efficiency of the core algorithms and ensuring compatibility with earlier versions. Conclusions The <monospace>ViennaRNA Package 2.0</monospace>, supporting concurrent computations <monospace>via OpenMP</monospace>, can be downloaded from <url>http://www.tbi.univie.ac.at/RNA</url>.</p

Fraunhofer-ePrints